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ARTICLE INFO ABSTRACT 

Keywords: This article investigates the relations among the tip-of-the-tongue (TOT) state, event related 
Tip-of-the-tongue potentials (ERPs) to correct feedback to questions, and subsequent memory. ERPs were used to 
ERP investigate neurocognitive responses to feedback to general information questions for which 
EEG 


participants had expressed either being or not being in a TOT state. For questions in which 
participants were unable to answer within 3s, they indicated whether they were experiencing a 
TOT state and then were immediately provided with the correct answer. Feedback during a TOT 
state, as opposed to not knowing the answer, was associated with enhanced positivity over 
centro-parietal electrodes 250-700 ms post-feedback, and this enhanced positivity mediated a 
positive relationship between TOTs and later recall. Although effects of increased semantic access 
during TOT states cannot be ruled out, these results suggest that information received during TOT 
states elicits enhanced processing—suggestive of curiosity—leading to enhanced learning of 
studied material. 


Learning 
Memory 


1. Introduction 


The tip-of-the-tongue (TOT) state is often a highly frustrating phenomenon. Described as “a gap that is intensely active” (James, 1890, 
p. 251) or as “personal introspections of inner turmoil when grappling for an elusive word," (Brown, 1991, p. 205) TOT states are defined 
as the simultaneous feeling of being unable to recall an item, and confidence that one could recall, given enough time (Schwartz, 2006). 
According to metacognitive models, the TOT state is analogous to a “low tire pressure” signal in a car in that it alerts one about a problem 
(failed recall); yet it is not the problem itself (Schwartz & Metcalfe, 2011). While a low tire pressure signal serves to remind car owners to 
inflate their tires, the function of the TOT state is more opaque. As TOT states have been associated with curiosity for un-retrieved 
information (Metcalfe, Schwartz, & Bloom, 2017), it may be the case that TOTs function as a signal to amplify processing when feedback, 
in the form of the target of the failed recall attempt, is provided. In the present research, we investigated whether TOT states function to 
aide learning, and whether such learning is mediated by enhanced processing of external feedback. 

Apart from curiosity specifically, TOTs have been identified in several contexts as motivating cognitive processes. One possible 
function for TOTs is to drive retrieval attempts, and indeed several studies have found that individuals devote more resources towards 
retrieving items while they are experiencing TOT states (Kikyo, Ohki, & Sekihara, 2001; Schwartz, 2001). For example, individuals 
spent more time attempting to recall answers while in TOT states than when they were sure they did not know the answers (Schwartz, 
2001). When participants were given a secondary working memory task in addition to TOT-eliciting cues, performance was poorer on 
the secondary task for trials in which TOT states were experienced, suggesting greater devotion of cognitive resources to memory 
search processes (Ryan, Petty, & Wenzlaff, 1982). 
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In addition to motivating retrieval efforts, TOT states have also been suggested to drive curiosity as operationalized by answer- 
seeking (i.e., either looking up the answer or asking for the answer). Litman, Hutchins, and Russon (2005) gave participants a series 
of 12 general information questions, and for each asked participants whether they knew the answer, they didn’t know the answer, or 
they were in a TOT state. When participants were later allowed to open envelopes containing answers to these same questions, the 
researchers reported that participants were more likely to choose questions about which they had earlier been in a TOT state than 
those about which they were not in a TOT state. These findings were limited by the small number of questions used per participant 
(and the small number of TOT states) and by the long time period between the experience of being in the TOT state and the choices. 
In a subsequent study (Metcalfe et al., 2017) using many more questions (82) with very little time to attempt retrieval (resulting in all 
participants experiencing TOT states), and also requiring participants to choose immediately — while they were still in a TOT state — 
participants were approximately twice as likely to choose to see the TOT state-related answers than the non-TOT state answers, 
indicating that TOTs are associated with elevated curiosity for target items. 

Indeed, TOTs seem to fit within Berlyne’s (1954) theory of epistemic curiosity, which posits that the items that are almost known 
are the ones which most spark human interest. Epistemic curiosity’s function is thought to be a motivator to close the gap between 
what one knows and what one desires to know (Berlyne, 1954, 1960; Litman & Spielberger, 2003; Loewenstein, 1994), with items 
that are perceived to be almost known being those most likely to provoke such curiosity. Indeed, when individuals were presented 
with learning tasks, they chose to spend more time studying items that were within a “region of proximal learning,” (RPL) which is 
defined as consisting of those as-yet-unlearned items with the highest judgments of learning (Metcalfe & Kornell, 2005; Metcalfe, 
2002). In addition, individuals mind-wandered less when studying items they judged to be within their RPL, as compared to items 
that were perceived to be either too easy or too difficult (Xu & Metcalfe, 2016). It may be possible to conceptualize the TOT state as a 
marker that the unrecalled information resides within an RPL-like domain: between information that is already well known and that 
which is completely unfamiliar. It thereby elicits a high degree of curiosity. 

Curiosity has also been linked to enhanced subsequent memory. In an fMRI study, Kang et al. (2009) administered general in- 
formation questions to participants and probed for their curiosity on each question. When the experimenters provided participants with 
correct answer feedback, answers to curiosity-evoking questions were associated with activation in the left inferior frontal gyrus (IFG) 
and left parahippocampal gyrus (PHG), regions believed to be active in verbal memory encoding (Paller & Wagner, 2002; Wagner et al., 
1998). In a follow-up behavioral experiment, feedback following incorrect answers about which the participant was curious was better 
recalled than feedback about errors about which the participant had not been curious. While the researchers did probe participants as to 
whether they experienced TOTs, participants did not report enough TOT states for analysis under this paradigm. 

If the TOT state is, itself, a marker of curiosity, then memory may also be better for feedback provided when the learner is in a 
TOT state. Supporting this idea, Gardiner, Craik, and Bleasdale (1973) found improved subsequent memory for items for which 
participants had been in a TOT, compared to those items for which participants were not in a TOT. They suggested that TOT states 
enhance processing, either through increasing the amount of attention devoted to feedback or by activating more semantic attributes 
of the target word. MacKay and Burke (1990) have suggested that TOTs occur when considerable semantic information about a target 
is available, and yet the phonological information is insufficient to allow production of the name (also see, Burke, MacKay, Worthley, 
& Wade, 1991). It is well documented that individuals have partial information about the target when they are in a TOT state: they 
can often recall the first letter (Brown & McNeill, 1966; Koriat & Lieblich, 1974), the number of syllables (Burke et al., 1991), and 
even the grammatical gender of the word (Miozzo & Caramazza, 1997). Furthermore, given sufficient time they can sometimes recall 
the target (Cohen & Faulkner, 1986). Thus, as noted by Gardiner et al. (1973), it is likely that this partial information, as well as TOT- 
related increases in attention, contributes to the high rates of recall for TOT-evoking items. We further explore this relation between 
TOT states and subsequent memory, and the possibility that people's event-related potentials (ERPs), while receiving feedback when 
they are either in a TOT state or not, may shed light on this relation. 

There is a substantial body of work showing that distinct neural patterns during learning can be found based on whether items are 
successfully recalled in a subsequent test. In particular, ERPs for later remembered items tend to display heightened late positivity 
during learning compared to those that are forgotten (Fabiani, Karis, & Donchin, 1990; Friedman & Johnson, 2000; Friedman & Trott, 
2000; Friedman, 2003; Friedman, Ritter, & Snodgrass, 1996; Paller, Kutas, & Mayes, 1987; Paller, McCarthy, & Wood, 1988; Sanquist, 
Rohrbaugh, Syndulko, & Lindsley, 1980). This enhanced processing for subsequently remembered items, named the difference in 
memory effect (Dm; Paller et al., 1987), has been found with a variety of learning tasks, and with both words (Fabiani & Donchin, 
1995; Fabiani, Karis, & Donchin, 1986) and pictures (Friedman & Sutton, 1987) as stimuli. 

While Dm effects show a consistent pattern such that subsequently remembered items are associated with more positive ERP 
amplitudes, the latency and scalp distributions of such effects vary with the task and stimuli such that the effect has been recorded 
over frontal (Friedman, 1990; Miinte, Heinze, Scholz, & Kiinkel, 1988; Paller & Kutas, 1992; Sanquist et al., 1980), central (Paller, 
1990), and parietal electrodes (Paller & Wagner, 2002; Paller et al., 1987; Weyerts, Tendolkar, Smid, & Heinze, 1997). In addition, 
while some Dm effects resemble the P300 component elicited by novel items (Wagner, Koutstaal, & Schacter, 1999), the heightened 
positivity for subsequently recalled items in many cases persists much longer, up to 900-1000 ms post-stimulus, than the typical P300 
(Friedman & Sutton, 1987; Smith, 1993). Furthermore, as demonstrated by Friedman (1990), the subsequent memory effect may be 
dissociated from the P300 with which it overlaps by differences in scalp distribution between the two components. The heightened 
ERP amplitudes for later remembered items are thought to reflect enhanced encoding, as items subject to deep semantic processing 
display larger Dm effects than items processed in a shallow non-semantic fashion (Paller et al, 1987). Further, Wagner et al. (1999) 
suggested that in some instances, subsequent memory effects depend on the existence of prior knowledge that individuals can 
incorporate into their representations of the current stimuli. Because TOT states are thought to involve a high degree of prior 
knowledge (Schwartz & Metcalfe, 2011) or semantic access (Burke et al., 1991), it may be the case that the differential processing of 
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feedback to TOT items, as compared to no-TOT (and unknown) items, could resemble Dm effects previously found in the literature. 

Several neurocognitive studies have found evidence for heightened monitoring, executive control, or continued memory search 
during TOTs, as compared to instances when a target was either recalled or unknown (for review, see; Diaz, Lindin, Galdo-Alvarez, & 
Bujan, 2014; Schwartz & Diaz, 2014). In particular, fMRI studies indicate heightened TOT-related activation in areas associated with 
monitoring and executive control such as the dorsolateral prefrontal cortex and anterior cingulate cortex (Maril, Simons, Weaver, & 
Schacter, 2005; Maril, Wagner, & Schacter, 2001). Additionally, an ERP study by Diaz, Lindin, Galdo-Alvarez, Facal, and Juncos- 
Rabadan (2007), showed that when people were shown pictures of faces and then tasked with coming up with the names, from 0 to 
450 ms the ERP signal was not different between recalled and TOT items. However, there was a difference later, such that the ERP 
amplitude of the recalled items (and, indeed, non-recalled items) was higher than that of the TOT items — a result that is probably 
related to differences in response resolution (Burke et al, 1991; Schwartz & Metcalfe, 2011). While these studies allow us to better 
understand the neural processes during the TOT state itself, to our knowledge no neurocognitive studies have been conducted 
examining processing of feedback while in the TOT state. 

In the current study, we explored, using ERPs, whether the TOT state is associated with heightened processing of feedback to 
general information questions. We examined whether subsequent memory is enhanced for items that have previously been associated 
with TOT states. We also asked whether there were distinct ERP signatures for processing of feedback in TOT states. In our ex- 
periment, we asked participants a series of questions, probed them as to whether they were in a TOT state, then provided the correct 
feedback for each question. Afterwards, participants took a memory test on the same questions as before. ERPs time-locked to the 
correct feedback given immediately after the TOT probe were compared as a function of whether participants reported being in a TOT 
state or not during that question trial. We hypothesized that memory for the feedback to questions in which the participants had 
reported TOT states would be better than that for feedback to questions in which the participants had said they were not in a TOT 
state. Further, we predicted that feedback to TOT items would evoke distinct neural signatures indicative of enhanced processing, 
similar to Dm effects previously observed in the literature. 


2. Method 
2.1. Participants 


The 30 participants recruited (16 male, 13 female, one who did not report gender) were Columbia University students, ranging in 
age from 18 to 32 (M = 21.27 years; SD = 4.26). One participant did not report experiencing any TOT states during the TOT/ 
feedback phase and was discarded from analyses prior to EEG preprocessing. According to a criterion of > 5 useable EEG trials each 
following “TOT” and “no-TOT” responses during the TOT/feedback phase, 3 participants were excluded during EEG preprocessing, 
leaving 26 total participants for all analyses. Participants were recruited with community flyers and emails from lists of previous 
study participants. All participants reported themselves to be right-handed, fluent English speakers, in good physical health, and were 
free from medications known to affect the central nervous system. Participants were paid $15/hr for their time. The Columbia 
University and New York State Psychiatric Institute institutional review boards approved the project, and all participants provided 
written, informed consent. 


2.2. Materials 


The stimuli were 150 general information questions taken from the Nelson and Narens (1980) norms, and updated and corrected 
(see, Metcalfe, Casal-Roscum, Radin, & Friedman, 2015). For example, one question was “What is the capital of Jamaica?” (Answer: 
Kingston). The questions were displayed with PsychoPy2 in a large white font with a gray background on a computer monitor (Peirce, 
2007). 


2.3. Design 


The experiment was a within-subjects design, where all participants were presented with a set of general information questions for 
an initial question presentation/TOT judgment/feedback phase (subsequently referred to as the TOT/feedback phase), then retested 
on the same questions in a surprise retest phase (test phase). Order of presentation of questions was randomized for each participant 
for both the TOT/feedback phase and the test phase. EEG was recorded during the TOT/feedback phase only. In conducting our 
analyses, we treated whether participants experienced TOT states in the TOT/feedback phase as if it were an independent variable 
with two levels (TOT vs. no-TOT, coded as 1 and 0, respectively). 


2.4. Procedure 


After the EEG caps were applied, participants were instructed that they would be given a series of general information questions 
to try to answer verbally and, if they could not answer within the allotted time, to report whether they were in a TOT state or not. 
Participants were also told that they would get to view the answers to all the questions for which they could not generate an answer in 
the allotted time. Although participants nearly always knew what a TOT state was (also see Schwartz & Metcalfe, 2011), for each 
participant, the experimenter explained the TOT state as a phenomenon in which “you feel sure you know the answer and think you 
can get it—it is imminent—but you cannot think of it at the moment.” Participants completed a practice trial before the experiment 
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began, and were not informed that there would be a test phase at the end of the experiment. 

For each trial in the TOT/feedback phase, the general information question first appeared onscreen and was read aloud by the 
experimenter, who immediately upon finishing hit a key to start the 3-s response period. During this period, if the participant gave 
any verbal answer, whether correct or not, the experimenter hit a key to move on to the next question immediately without providing 
feedback. At the end of the 3-s period if no response had been given, participants indicated whether they were in a TOT state or not. 
Then, a fixation cross appeared on the screen for 500 ms, followed by a 1000 ms duration feedback (correct answer) to the question 
for that particular trial. A jittered 750-1250 ms blank screen was shown before each subsequent trial. 

Following the TOT/feedback phase, participants were given a 2-minute break in which the EEG cap was removed, then given 
instructions on how to complete the test phase. During the test phase, participants saw the same 150 general information questions 
for a second time (in a different random order), and were given 15s per question to type a response. Participants were instructed to 
type their best guess to each question even if they were not sure of their answers. Feedback was not provided during the test phase, 
and participants’ responses were scored using an approximate pattern matching algorithm (AGREP, generalized Levenshtein edit 
distance = .1, Ripley & Hornik, 2010), then checked by an experimenter blind to the TOT/feedback phase data. At the end of the 
experiment, after filling out a demographics sheet, participants were thanked, debriefed, paid for participation, and given the op- 
portunity to see the answers to all of the general information questions if they wished. 


2.5. Electroencephalographic (EEG) recording and preprocessing 


Brain electrical activity was recorded during the TOT/feedback phase from 62 scalp sites (sintered Ag/AgCl) mounted in an 
Electrocap (Neuromedical Supplies) and digitized at 500 Hz (DC; high-frequency cut-off of 100-Hz; right-forehead ground). Electrodes 
were placed on the outer canthus of each eye to record horizontal eye movements, and directly above and below the left eye for vertical 
movements. Activity was originally referenced to the nose and re-referenced offline to the average of the left and right mastoids. 
Impedances were maintained below 10k throughout the experiment. Prior to analyses, all recordings were filtered using a 0.1-25 Hz 
IIR-Butterworth bandpass filter to remove DC drift and muscle movements. Electrodes (between 1 and 4 per subject) were interpolated 
for 13 of our participants due to abnormal patterns or excessive fluctuation. Interpolation within subjects was done across all trials. 

Offline artifact rejection and independent component analysis (ICA, Delorme & Makeig, 2004; Makeig & Onton, 2011) were used 
to remove eye blinks, eye movements, muscle activity, and other noise from the data. A subsequent round of artifact rejection was 
also conducted in which single trials containing voltage deviations of over 50 nV from baseline were removed from analysis. As no 
feedback was presented for trials in which participants verbally recalled the answers during the 3s response period after question 
presentation, these trials were not included in the EEG analyses. Our preprocessing pipeline yielded a total of 2022 (648 TOT, 1374 
no-TOT) trials for EEG analysis across all 26 participants included (by participant: Mror trials = 24.9, SD = 12.6; Myo-ror 
Trials = 92.8, SD = 18.6). 

Numeric integration was used to calculate the average area under the curve over the measurement time window of interest (i.e., 
negative amplitudes were subtracted from positive amplitudes and integrated over the measurement interval) using EEGLAB and 
ERPLAB (Delorme & Makeig, 2004; Lopez-Calderon & Luck, 2014). All ERP analyses were conducted on mean amplitudes equivalent 
to dividing the integral of the waveform by the length of our time window. A 200 ms pre-stimulus baseline was used. ERPs were time 
locked to presentation of the correct answer during the TOT/feedback phase of the experiment. Electrodes for our analyses were 
determined based on scalp topographies, and the most active electrodes from 250 to 700 ms post-onset across all trials were the focus 
of our analyses (Fig. 2). Analyses focused primarily on this time window, as we were interested in semantic processing, rather than 
sensory components occurring before 250 ms (Luck, 2014). Based on these criteria, we selected 9 centro-parietal electrodes for 
analysis: Cz, Cl, C2, CPz, CP1, CP2, Pz, Pl, & p2'. 


3. Results 


For the analyses computed, the criterion for statistical significance was set at p < .05. For all ANOVAs, the Greenhouse-Geisser 
procedure (Jennings & Wood, 1976) was used to correct the degrees of freedom in order to account for violations of the homogeneity 
of variance assumption. 


3.1. Behavioral results 


3.1.1. TOT states 

Of the 26 participants included for analyses, the average proportion of questions eliciting TOT states was .22 (SD = .08). The 
average proportion of questions eliciting a ‘don’t know’ response was .46 (SD = .13). All trials during the TOT/feedback phase in 
which the participant gave an answer (average proportion = .32, SD = .15) were excluded from both behavioral and ERP analyses, as 
no feedback was presented for these trials. 


7 Because the most active centro-parietal region was slightly right-lateralized, we also completed the same analyses with 12 electrodes corresponding to the 
lateralization (Cz, C1, C2, C4, CPz, CP1, CP2, CP4, Pz, P1, P2, & P4) and our findings remained the same. 
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Fig. 1. Within-subjects scatter plot. Dots are individual subject recall probabilities given an initial No-TOT trial (x-axis) vs. an initial TOT trial (y- 
axis). Error bars are + 2 standard errors. 


3.1.2. Post-test recall 

Participants’ typed responses were scored for accuracy. Recall probability in the test phase for all items, including questions where 
participants initially responded verbally during the TOT/feedback phase and saw no feedback, was .58 (SD = .15). Participants’ 
recall was better for trials in which they had been in the TOT state (M = .75, SD = .11) than those in which they had simply not 
known the answer (M = .52, SD = .19; t(25) = 8.64, p < .0001, see Fig. 1). 


3.2. Electrophysiology 


3.2.1. TOT versus no-TOT analysis 

Tip-of-the-tongue state was associated with increased ERP amplitudes, at centro-parietal electrodes, 250-700 ms post-feedback-onset. 
We conducted a 2 x 9 repeated measures ANOVA of ERP amplitude using TOT state (TOT vs. no-TOT) and Electrode (Cz, C1, C2, CPz, 
CP1, CP2, Pz, P1, & P2) as within-subjects factors. Degrees of freedom were adjusted using the Greenhouse-Geisser procedure. There was a 
main effect of TOT state, such that participants showed greater amplitude to feedback for TOT trials than for no-TOT trials (F(1, 
25) = 21.77, p < .0001; Fig. 3). Overall ERP amplitudes did not differ significantly across the 9 electrodes chosen for analysis (F(2.04, 
51.1) = 2.12, p = .130), and electrode location did not interact significantly with TOT state (F(3.17, 79.24) = 1.37, p = .257). 


3.2.2. Dm analysis 

ERP amplitudes to feedback differed as a function of whether an item was subsequently correctly recalled or unrecalled. Fig. 4 
displays the grand average waveforms at cento-parietal electrode sites during feedback in the TOT/feedback phase as a function of 
subsequent recall in the test phase. An ANOVA (Greenhouse-Geisser adjusted) revealed a main effect” for subsequently remembered 
trials on averaged ERP amplitudes in our time window of 250-700 ms, such that amplitudes were greater for TOT/feedback phase 
trials in which the correct feedback was subsequently recalled in the test phase (F(1,25) = 26.31, p < .0001). In this model ERP 
amplitudes did not differ significantly across the 9 electrodes chosen for analysis (F(1.57, 39.34) = 2.25, p = .129), and electrode 
location did not significantly interact with subsequent recall (F(2.72, 67.99) = 1.41, p = .248). 


3.2.3. TOT versus no-TOT conditional upon recall 

It is well documented in the literature (and was shown in our data in Fig. 4) that items that are subsequently recalled reveal a 
higher amplitude late positivity than items that are subsequently forgotten. Furthermore, TOT items are more likely to be recalled 
than non-TOT items. In an attempt to examine whether it was only this difference in ease of recall, and the increased information 
associated with the target item at time of encoding, that was responsible for the difference between the TOT and non-TOT ERP 
patterns, or whether the TOT state itself might also have contributed, we conducted an additional analysis in which we used only the 
items that were subsequently recalled. These items were divided into those that were associated with TOTs and those that were not. 
Even when all of the items contributing to the analysis were recalled, there was still a main effect of TOT state, as is shown in Fig. 5: 
ERP amplitudes in the 250-700 ms window were greater for feedback to TOT trials than for no-TOT trials (F(1, 25) = 4.84 p = .037). 


? In order to weight individual subjects by their respective number of included trials and create a more easily interpretable mapping between TOT states, ERPs, and 
learning, we conducted further analysis (see Single-Trial Analysis & Fig. 6) using a model in which single-trial ERP amplitude was used as a predictor for subsequent 
recall. 
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Mean Voltage (pt V) 


Fig. 2. Scalp topography displaying grand average mean voltage (uV) from 250 to 700 ms post-feedback onset, calculated for all trials across all 
participants. 
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Fig. 3. ERPs time-locked to feedback onset during the initial TOT/feedback phase. ERPs to feedback showed enhanced late positivity in centro- 
parietal electrodes in TOT trials (red) compared to no-TOT trials (black). Shading = 95% confidence interval at each time point. N = 26. (For 
interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.) 


Overall ERP amplitudes did not differ significantly across the 9 electrodes (F(1.69, 42.154) = 2.34, p = .116), and electrode location 
did not interact significantly with TOT state (F(3.31, 82.69) = 1.19, p = .565). 


3.2.4. Single-trial analysis 

ERP amplitudes in response to feedback predict subsequent recall. To further examine the relationship between processing of TOT 
feedback and subsequent recall, we isolated the single-trial ERP data for each participant using custom MATLAB and R scripts. For 
each trial, the ERP amplitude was averaged between 250 and 700 ms post-stimulus across the 9 electrodes of interest. We then 
regressed’ the correct/incorrect post-test answers on feedback ERP amplitude, whether the participant was in a TOT, and their 


3 We estimated a multilevel logistic regression with the R package brms (Biirkner, 2017; R Core Team, 2017) built on Stan (Gelman et al., 2015). We allowed the 
effects of TOT and amplitude to vary across participants, and used software-provided uninformative priors on all parameters. 
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Fig. 4. ERPs to feedback showed enhanced late positivity for trials in which the item was subsequently recalled in the test phase. N = 26. 


interaction (Fig. 6). Consistent with the previous results, TOT state positively predicted probability of subsequent recall such that 
recall was better for TOT than no-TOT trials (B = .94, 95% CI = [0.63, 1.24])*. Single-trial ERP amplitude also positively predicted 
probability of subsequent recall (8 = .04, 95% CI = [0.03, 0.06]). There was no interaction between TOT state and ERP amplitude 
(B = .01, 95% CI = [—0.02, 0.04]). 

To take into account effects of individual electrodes, we conducted a similar regression in which we did not average across 
electrodes, but instead allowed effects of TOT and amplitude to vary across both participants and our 9 electrodes of interest. This 
model produced very similar predictions to the first, such that TOT state (8 = 1.03, 95% CI = [0.59, 1.51]) and ERP amplitude 
(B = .04, 95% CI = [0.01, 0.07]) both positively predicted recall probability. There was no interaction between TOT state and ERP 
amplitude (B = .01, 95% CI = [—0.02, 0.05]). 


3.2.5. Mediation analysis 

The above results suggested a causal model according to which feedback presented during a TOT state elicits greater positive 
centro-parietal ERP responses, and this ERP magnitude in turn leads to a higher probability of correctly recalling the item at a later 
test. To formally address this causal model, we performed a multilevel mediation analysis, with TOT as the independent variable, 
single-trial ERP (averaged across electrodes) as the mediator, and binary recalled/not recalled as the dependent variable° (Vuorre & 
Bolger, 2017; Vuorre, 2017). This mediation analysis (see Fig. 7) indicated that ERP amplitude partially mediated (indirect ef- 
fect = 0.10, 95% CI = [0.04, 0.11]) the relationship between TOT states and subsequent recall (proportion mediated effect = 0.093, 
95% CI = [0.036, 0.16]). A direct effect of TOT states on subsequent recall remained after taking ERP amplitudes into account 
(c’ = 1.02, 95% CI = [0.74, 1.29]). Although the mediation effect was partial, this finding lends support for a model in which the 
effect of TOT states on subsequent recall is mediated by ERP amplitudes. 


* Note that the B for TOT state is the effect, in log-odds, of a categorical predictor coded as a 1 for TOT trials and 0 for no-TOT trials. B for ERP amplitude is the 
effect, in log-odds, of a continuous predictor with microvolts (uV) as units. 

5 This analysis was conducted using the same measure as our previous ERP analyses; single-trial averaged amplitude 250-700ms post-stimulus at centro-parietal 
electrodes. We modeled the binary outcomes with a logistic link function—therefore the relevant effects are in log-odds. We again used software-provided unin- 
formative priors for all parameters. 
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Fig. 5. ERPs to TOT feedback only on items that were subsequently recalled. N = 26. 


4. Discussion 


This experiment examined whether TOT states were associated with enhanced learning, and whether this relationship was 
mediated by neurocognitive processing of feedback, as measured through ERPs. In addition, we tested whether subsequent recall was 
improved for items in which TOTs had been evoked during the TOT/feedback phase. If neural processing of feedback mediated the 
relationship between TOT states and recall, we expected that feedback-evoked ERPs in TOT trials (compared to no-TOT trials) would 
be increased in amplitude, and that this increased amplitude would positively predict subsequent recall probability. The data indeed 
showed that TOT states preceding the feedback phase predicted subsequent recall, and that this relationship was partially mediated 
by ERP amplitudes over centro-parietal sites. Although our experimental design precludes direct claims of causality, our results 
support a model in which TOT states facilitate learning through heightened processing of feedback. 

With our single-trial analyses we asked whether ERP amplitudes predicted subsequent recall. We found that items evoking higher 
average amplitudes over central-posterior electrodes in the TOT/feedback phase were more likely to be recalled in the test phase, and 
that this ERP measure mediated the relationship between TOT states and subsequent recall. All subjects appeared to similarly show 
this mediation effect (see $1). Consistent with the mediation model, higher ERP amplitudes were observed for feedback to TOT trials, 
compared to no-TOT trials, even between items in which the target was subsequently recalled. These results provide convergent 
support for a framework in which TOT states themselves contribute to improved learning due to enhanced neurocognitive processing 
of feedback. 

The difference in ERP waveforms evoked by feedback in TOT versus no-TOT trials was highly similar to that observed for ERPs to 
subsequently remembered versus forgotten items in the Dm literature. While Dm effects do vary in both location and latency de- 
pending on the task, the enhanced late processing at centro-parietal electrodes that we observed in response to feedback during TOT 
states resembles the neural signatures evoked during encoding of subsequently remembered items in several studies employing 
semantic tasks (Friedman & Johnson, 2000; Friedman et al., 1996; Paller & Wagner, 2002; Paller et al., 1987; Van Petten & Senkfor, 
1996). While we wish to be cautious about the extent to which we equate our TOT findings with Dm effects, the Dm literature can act 
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NO TOT 
TOT 


Single-Trial Mean Amplitude (uV) 


Fig. 6. Multilevel logistic regression model for post-test recall probability using single-trial EEG data. Amplitude averaged across the time window 
(250-700 ms post-feedback onset) was measured in microvolts (uV). Thick lines = population-level regression line; thin lines = subject-specific 
regression lines; shading = 95% CI of the population-level regression line. N = 26. 


me = 0.1 [0.04, 0.17] 
c= 1.1 [0.84, 1.4] cite tennis 
pme = 0.093 [0.036, 0.16] 

cov(a,b) = -0.005 [-0.039, 0.02] 


a=2.3 b = 0.047 
1.3, 3.3] [0.03, 0.065] 
SD = 1.3 SD = 0.024 
[0.099, 2.6] [0.0016, 0.055] 
c=1 
TOT (Y/N) nee P(Recall) 
[0.023, 0.7] 


Fig. 7. Path diagram of the multilevel mediation model. c = total effect (direct + indirect effect of TOT on P(Recall), me = mediated effect, 
ce’ = direct effect, pme = proportion of effect that is mediated, cov(a,b) = covariance of subject-level a and b parameters. Parameters are reported 
with 95% credible intervals in square brackets. 


as a useful backdrop for understanding how TOT states predict greater encoding of feedback. Notably, our finding that ERP am- 
plitudes predicted subsequent memory (independent of TOT states) for our task is itself a true Dm effect—subsequently remembered 
items display enhanced late processing in response to feedback. 

We may also be able to contextualize the relationship between TOTs and subsequent recall within previous work on ‘hy- 
percorrection’. In paradigms involving general information questions, high-confidence errors result in better learning of feedback, 
compared to low-confidence errors (Butler, Fazio, & Marsh, 2011; Butterfield & Metcalfe, 2001, 2006; Metcalfe et al., 2015; Metcalfe, 
2017). Hypercorrection effects have been suggested to stem from heightened attention during the feedback following the error, as 
well as to the semantic similarity of the errors to the corrective feedback. The former postulated mechanism for the enhanced memory 
was supported both by evidence that the P3 component was increased to high-confidence error feedback (Butterfield & Mangels, 
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2003; Metcalfe et al, 2015) and by participants’ failure to detect auditory signals in a secondary task during such feedback 
(Butterfield & Metcalfe, 2006). These latter results suggest that, in addition to the informational difference (Finn & Metcalfe, 2007; 
Metcalfe & Finn, 2011; Sitzman, Rhodes, Tauber, & Liceralde, 2015), there may be a feedback loop during hypercorrection that 
recruits attention and strengthens encoding processes (Butterfield & Metcalfe, 2006; Metcalfe, 1993). Further, items evoking high- 
confidence errors may reside within an individual’s region of proximal learning and thus signal that it will be beneficial to invest 
effort into learning them (Metcalfe & Finn, 2011). 

In the case of the TOT state specifically, it may be that heightened curiosity for the target information engages attention, boosts 
encoding, and leads to improved recall. In this model, the TOT state acts as a ‘gain control’ to amplify attention to and encoding of 
feedback, increasing the likelihood of subsequent memory. The enhanced encoding due to the TOT state thus spurs learning of 
feedback. Such a model is also consistent with predictions that TOT-evoking items sit within an individual’s region of proximal 
learning, and thus recruit heightened attention to feedback. Indeed, similar work has suggested that there is an increase in processing 
at parietal electrodes when participants were attentive to and focused on learning, relative to when they were mind wandering (Xu, 
Friedman, & Metcalfe, 2018), further supporting the idea that participants might have been more attentive to feedback during TOT 
states. 

It should be noted, though, that the observed differences in recall and in the ERPs to feedback might not be due exclusively to 
enhanced attentional processing. These differences might also be attributed in part to the likelihood that participants possessed more 
prior information for answers in which they experienced TOT states. Both metacognitive models such as Schwartz and Metcalfe’s 
(2011) heuristic model and linguistic models such as Burke et al.’s (1991) models based upon Node Structure Theory (and see, Burke 
& Shafto, 2004), indicate that TOT experiences arise when there is considerable accumulated information from a variety of sources 
(syntactic, partial semantic, phonetic, or cue-based information). Subsequent memory is also strongest when there is a high degree of 
prior information available for the items being learned (Paller et al., 1987; Van Petten & Senkfor, 1996). It follows that enhanced 
memory for feedback to TOT states would be expected based only on prior information. Interpretation of the extent to which our 
results are due to curiosity and increased attention as compared to only the availability of semantic information remains a challenge: 
both components probably contributed to our effects. 

While it is very likely, then, that differences in semantic access contributed to our results, the finding (1) of a mediating role of 
ERP amplitudes and (2) that ERP amplitudes were enhanced more for TOT as compared to non-TOT related feedback even when only 
items that were later recalled were examined provides support for the idea that enhanced processing, rather than only differences in 
the amount of information, plays a role in the learning associated with TOT states. To more directly address whether TOT states 
influence ERPs and subsequent learning above and beyond the effects of semantic access, future experimental designs are needed in 
which TOT states are evoked while the degree of semantic access is held constant. 

Given the experimental design in the present study, we were only able to directly compare learning and ERPs to feedback for 
items where participants did not overtly provide an answer (‘TOT’ versus ‘Don’t Know’ responses). Future studies are needed to 
compare long-term recall and neurocognitive responses to feedback following TOT states versus successful retrieval. In addition, the 
current design did not investigate ‘blocked’ TOTs, in which an intruding word occurred (Brown & McNeill, 1966; Burke et al., 1991; 
Jones & Langford, 1987; Kornell & Metcalfe, 2006). Research is needed on whether the ERP signatures and learning for blocked TOTs 
are the same or different from those shown here. 

While the results of our study indicate that TOT states may facilitate learning when correct feedback is provided, we do note that 
it has also been suggested that the effects of TOTs may not be uniformly positive: they may result in error learning through implicit 
learning of an unsuccessful retrieval attempt (D’Angelo & Humphreys, 2015), and recurrence on a second test (Warriner & 
Humphreys, 2008). It is certainly possible that TOTs help learning through an initial provocation of curiosity, yet may have some 
harmful effects if no resolution is attained despite an extended time period. Nevertheless, the finding that being in a TOT state when 
correct feedback is provided is associated with better subsequent memory, as was found here, and that the neural correlates point to 
enhanced processing of the feedback, may have valuable applications for learning. More broadly, research on how initial failed 
retrievals and initial errors impact subsequent memory may contribute to the development of new strategies for improving learning 
in both educational and therapeutic settings. 
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